NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Overcoming the Sim-to-Real Gap: Leveraging Simulation to Learn to Explore for Real-World RL

Wagenmaker, Andrew; Huang, Kevin; Ke, Liyiming; Jamieson, Kevin; Gupta, Abhishek (April 2025, Curran Associates, Inc.)
Globerson, A; Mackey, L; Belgrave, D; Fan, A; Paquet, U; Tomczak, J; Zhang, C (Ed.)
Free, publicly-accessible full text available April 1, 2026
Sample Complexity Reduction via Policy Difference Estimation in Tabular Reinforcement Learning

Narang, Adhyyan; Wagenmaker, Andrew; Ratliff, Lillian; Jamieson, Kevin (December 2024, Neural Information Processing Systems)

Full Text Available
Overcoming the Sim-to-Real Gap: Leveraging Simulation to Learn to Explore for Real-World RL

Wagenmaker, Andrew; Huang, Kevin; Ke, Liyiming; Boots, Byron; Jamieson, Kevin; Gupta, Abhishek (December 2024, Conference on Neural Information Processing Systems)

Full Text Available
Active Learning of Neural Population Dynamics Using Two-Photon Holographic Optogenetics

Wagenmaker, Andrew; Mi, Lu; Rozsa, Marton; Bull, Matthew S; Svoboda, Karel; Daie, Kayvon; Golub, Matthew D; Jamieson, Kevin (December 2024, Neural Information Processing Systems)

Full Text Available
Optimal Exploration for Model-Based RL in Nonlinear Systems

Wagenmaker, Andrew; Shi, Guanya; Jamieson, Kevin (December 2023, Advances in neural information processing systems)

Full Text Available
Optimal Exploration for Model-Based RL in Nonlinear Systems

Wagenmaker, Andrew; Shi, Gunaya; Jamieson, Kevin (December 2023, Advances in neural information processing systems)

Full Text Available
Humor in AI: Massive Scale Crowd-Sourced Preferences and Benchmarks for Cartoon Captioning

https://doi.org/10.52202/079017-3978

Chen, Jiayi; Guo, Yang; Jain, Lalit; Jamieson, Kevin; Mankoff, Robert; Nowak, Robert; Rogers, Timothy; Sievert, Scott; Suresh, Siddharth; Wagenmaker, Andrew; et al (January 2024, Neural Information Processing Systems Foundation, Inc. (NeurIPS))

Full Text Available
Instance-Dependent Near-Optimal Policy Identification in Linear MDPs via Online Experiment Design

Wagenmaker, Andrew; Jamieson, Kevin (January 2022, Advances in neural information processing systems)
Koyejo, S.; Mohamed, S.; Agarwal, A.; Belgrave, D.; Cho, K.; Oh, A. (Ed.)
While much progress has been made in understanding the minimax sample complexity of reinforcement learning (RL)—the complexity of learning on the “worst-case” instance—such measures of complexity often do not capture the true difficulty of learning. In practice, on an “easy” instance, we might hope to achieve a complexity far better than that achievable on the worst-case instance. In this work we seek to understand the “instance-dependent” complexity of learning near-optimal policies (PAC RL) in the setting of RL with linear function approximation. We propose an algorithm, Pedel, which achieves a fine-grained instance-dependent measure of complexity, the first of its kind in the RL with function approximation setting, thereby capturing the difficulty of learning on each particular problem instance. Through an explicit example, we show that Pedel yields provable gains over low-regret, minimax-optimal algorithms and that such algorithms are unable to hit the instance-optimal rate. Our approach relies on a novel online experiment design-based procedure which focuses the exploration budget on the “directions” most relevant to learning a near-optimal policy, and may be of independent interest.
more » « less
Full Text Available
Beyond No Regret: Instance-Dependent PAC Reinforcement Learning

Wagenmaker, Andrew; Simchowitz, Max; Jamieson, Kevin (January 2022, Proceedings of Machine Learning Research)

Full Text Available
Active learning with Safety Constraints

Camilleri, Romain; Wagenmaker, Andrew; Morgenstern, Jamie; Jain, Lalit; Jamieson, Kevin (January 2022, Advances in neural information processing systems)

Full Text Available

« Prev Next »

Search for: All records